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Abstract 


,  Models  of  coin-tossing  have  been  considered  both  in  their  own  aspect  or  as 
specialized  Markov  chains.  ^  A  common  reference  for  both  approaches  is  still  William 
Feller’s  Introduction  to  Probhdility  Theory  and  its  Applications,  vol.  1,  1968.^  A  typical 
model  calls  for  tossing  a  (biased)  coin  until  a  certain  well  defined  stopping  event,  the 
"target  terminates  the  tossing;  the  random  variable  of  interest,  the  “tally,”  is  the 
number  of  tosses.  ^ 

The  stopping  ej^ents  in  Feller  and  in  most  other  references  are  of  two  sorts: 

(a)  A  string  df  heads  or  tails  of  fixed  size  and  order,  e.g.  hhhh,  or  ttttt,  or 
ththht,  etc.;  or 

(b)  A  string  of  r  beads  or  of  tails,  which  ever  shows  up  first. 

A  solution  is  usually  given  as  a  generating  function  for  the  number  of  tosses;  this 
may  require  tedious  algebraic  and  numerical  work  in  determining  the  distribution  and 

moments.  ■: 

/•  « 

In^^is  Report  w6  extencf.the  Feller-type  models  in  several  ways,  all  believed  to  be 

new: 

(1) - We'^  allow:  target  event  of  variable  length,  e.g.  h-odd  #  tails -h.  > 

(2)  We  allovMally  variables  other  than  number  of  tosses,  e.g.  number  of  heads, 
number  of  tails,  number  of  runs,  number  of  doublets  ht  on  way  to  the 
stopping  event,  etc.  ' 

(3)  We  €llowi  vector-valued  tallies,  e.g.  a  vector  composed  of  the  number  of 
heads,  number  of  tails,  numbef  of  runs,  and  number  of  triplets  hht;  our 
solution  provides  joint  and  marginal  distributions,  and  mixed  moments  of  the 
various  components, 

(4)  'A  central  result  shows  how  to  mechanically  transform  an  existing  solution 

(usually  in  the  form  of  a  generating  function)  for  the  number  of  tosses  into  a 
joint  solution  for  the  number  of  heads  and  tails.  We  need  not  even  know  the 
model  to  which  the  original  solution  refers.  This  result  is  of  both  practical 
and  methodological  interest  because  there  are  many  coin-tossing  problems 
worked  out  for  various  models  in  the  literature,  and  these  solutions  can  be 
transformed  into  joint  head-and-tail  solutions  with  minimal  effort.  There  is 
an  indication  that  such  transpositions  can  be  of  value  in  inferential  statistics. 

(5)  We  have  made  a  beginning,  towards  expressing  the  tally  process  in  terms  of 


simpler  building  blocks,  in  particular  geometric  random  variables. 
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(6)  We  show  how  in  many  instances  a  tossing  model  can  be  solved  from  the 
knowledge  of  several  initial  probabilities  but  some  more  work  remains  to  be 
done  to  develop  this  method,  which  is  essentially  the  method  of  indetermined 
coefficients. 

Feller  and  most  authors  base  their  derivations  on  probability  distributions,  whereas 
we  base  ours  on  random  variables  and  their  general  functionals.  Probability  densities 
and  distributions,  moments  and  generating  functions  are  instances  of  such  functionals. 

Our  notation  is  suitable  for  sums  and  linear  combinations,  including  convex 
combinations,  of  random  variables,  and  in  particular  for  tree  structures  because  these 
tend  to  be  rich  in  recursions.  This  notation  has  been  referred  to  as  the  omni-notation, 
and  out  method  as  the  omni-method.  The  gist  of  the  method  consists  in  formulating 
the  equations  of  mixtures  and  balances  for  an  arbitrary  functional  of  the  random 
process  of  interest  in  place  of  the  random  process  itself.  This  arbitrary  functional  can 
also  be  defined  as  the  expectation  of  an  arbitrary  function  of  a  random  process,  as  we 
have  done  in  other  contexts  including  queueing  theory.  We  refer  to  either  definition  as 
the  omni-transform  of  the  process. 

The  resulting  equations  are  called  omni-equations,  and  they  can  be  specialized  to 
distributions  and  generating  functions,  moments,  costs,  Pr(Z-n,  modulo  j)  for  the 
process  or  random  variable  z,  and  to  other  functionals.  Examples  of  such 
specialization  are  provided  as  well  as  examples  of  transposing  generating  functions  into 
their  own  omni-equivalents.  The  simplest  and  most  economical  way  of  deriving  the 
moments  of  a  tally  variable  with  a  known  generating  function  is  usually  to  first 
transpose  this  function  into  an  omni-equation  and  then  to  find  the  moments 
algebraically  or  numerically. 

We  give  examples  where  the  omni-method  applies  easily  whereas  generating 
functions  are  cumbersome  to  use,  e.g.  the  random  variable  ‘‘number  of  heads  minus 
number  of  tails,”  which  ranges  over  positive  and  negative  integers,  and  costs  which 
range  over  fractions,  both  positive  and  negative.  As  an  indication  of  the  power  of  our 
approach,  Hnding  the  probability  that  the  “number  of  heads  minus  number  of  tails”  is 
an  odd  integer  upon  reaching  the  target  hhh  is  a  rather  onerous  combinatorial  problem 
which  reduces  to  several  lines  when  using  the  omni-method. 

Some  of  our  results,  in  particular  the  costing  applications,  are  applicable  to  more 
general  Markov  and  semi-Markov  chains,  but  this  topic  must  be  left  to  a  future  report. 
An  attentive  reader  should  be  able,  however,  to  make  such  an  application  by  oneself. 
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Introduction 

Various  examples  of  coin-tossing  models  are  discussed  in  Feller  (1968).  These 
models  call  for  tossing  a  coin  until  a  certain  target  event  shows  up;  the  random 
variable  of  interest  is  the  number  of  tosses.  The  target  events  are  of  two  kinds: 

(а)  a  string  of  fixed  length  of  heads  and  tails  in  a  fixed  order,  in  particular  a  string  of 
heads  only  or  a  string  of  tails  only;  and  (b)  a  string  of  r  heads  or  p  tails,  whichever 
shows  up  first.  The  solution  is  given  generally  in  the  form  of  a  generating  function  for 
the  number  of  tosses. 

In  this  report  we  extend  these  models  in  several  ways: 

(1)  We  allow  target  events  of  variable  length,  e.g.  h-odd  #tails-h  (where  h  stands  for 
heads  and  t  for  tails)-, 

(2)  We  allow  tally  variables  other  than  the  number  of  tosses,  e.g.  number  of  heads, 
number  of  tails,  number  of  runs,  number  of  doublets  ht  on  way  to  targets,  and 
others,  in  particular  cost  of  tossing  defined  in  various  ways; 

(3)  We  allow  vectors  of  tally  variables,  e.g.  vectors  composed  of  number  of  heads, 
number  of  tails  and  number  of  runs; 

(4)  We  show  how  to  modify  an  existing  solution  (e.g.  in  forms  of  a  generating 
function)  for  the  number  of  heads  into  a  bivariate  solution  for  number  of  heads 
and  number  of  tails;  this  is  of  both  methodological  and  practical  interest  since 
there  are  in  the  literature  various  worked  out  solutions  for  the  number  of  heads 
for  various  targets; 

(5)  We  have  made  a  beginning  towards  expressing  the  tally  process  in  terms  of 
simpler  building  blocks,  in  particular  geometric  random  variables;  and 

(б)  We  show  how,  in  many  cases,  a  tossing  model  can  be  solved  from  the  knowledge 
of  several  known  initial  probabilities,  but  more  work  remains  to  be  done  to  extend 
and  justify  the  method. 

Feller  works  primarily  with  distributions,  we  work  mainly  with  random  variables.  The 
notation  we  have  introduced  is  suitable  for  sums  and  linear  combinations,  in  particular 
convex  combinations,  of  random  variables:  the  omni-functions  and  omni-equations. 
However,  an  omni-equation  can  be  as  easily  specialized  to  an  equation  in  probability 
distributions  as  into  an  equation  for  moments  or  for  generating  functions.  We  have 
shown  examples  where  the  omni-method  is  easily  applicable  but  generating  functions 
are  difficult  to  use,  e.g.  for  “number  of  heads  minus  number  of  tails,”  and  for  costing 
variables  which  can  be  positive  and  negative  and  fractional. 
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This  report  can  be  of  independent  interest  but  also  an  introduction  to  Markov 
processes.  The  special  structure  of  tossing  models  prompted  us  to  prepare  a  separate 
report  on  this  topic. 

We  wish  to  add  that  the  method  of  omni-equations  was  originally  introduced  in 
connection  with  queueing  problems  (cf.  Krakowski  (July  1986,  November  1986,  1987)). 
But  the  quoted  references  are  not  a  prerequisite  for  the  present  report.  In  fact,  our 
use  of  the  omni-method  is  simpler  than  in  queueing  which  involves  differentiation  of 
omni-functions. 

Notation 

We  list  here  the  symbols  used  throughout  most  of  the  report.  Some  symbols  used  only 
locally  are  for  the  most  part  omitted. 

r.v.  ;  random  variable 

h  toss  resulting  in  heads-,  k*h  means  k  successive  heads 
t  :  toss  resulting  in  tails-,  k*t  means  k  successive  tails 

gf  generating  function 

p  =  Pr(h),  q  =  Pr  (t) 

N  =  number  of  tosses;  Nj  =  Pr(N=j) 

H  =  number  of  heads-,  Hj  =  number  of  heads 

T  =  number  of  tails-,  N,  =  number  of  tails 


Nij  =  Pr(H=i,T=j) 

r  =  number  of  reversals  h  -♦  t  and  t  h;  the  number  of  runs  is  R=r+1 
\|;(A)  is  an  arbitrary  function  of  A;  \1;(A,B)  is  an  arbitrary  function  of  A  and  B 
8(j)  =  1  if  j=0,  and  vanishes  otherwise 

8(i,j)=  8(i)8(j)  =  1  if  i=j*0.  and  vanishes  otherwise 

E  is  the  expectation  operator 

Gj  is  a  geometric  r.v.  with  parameter  a,  i.e.  Pr(G3=j)=a(l-a)^“^ 

The  target  event  is  the  event  which  stops  the  tossing  project.  Thus  saying  that  the 
target  is  hhth  means  that  the  tossing  stops  as  soon  as  we  hit  hhth.  A  target  need  not 
be  of  fixed  length;  it  can  be  “h-odd  it  of  t-h”,  or  “hh  or  ttt  whichever  comes  first”. 

A  tossing  model  consists  of  a  tally,  which  is  a  random  variable  or  a  random  vector  of 
interest  and  a  target;  thus  {H,T,r;hht}  means  that  we  stop  tossing  as  soon  as  we  hit  hht 
and  we  count  the  number  of  heads  and  the  number  of  tails  and  the  number  of 
reversals;  clearly  we  are  interested  in  the  trivariate  vector  (H,T,r). 
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Omni-Convention  An  omni-equation,  e.g. 

Ei|;(N)  =  p^\lt(2)  +  pqE\l/(2+N)  +qE\if(l+N) 

can  be  written  as 

\lf(N)  =  p^\j/(2)  +  pq\l/(2+N)  +q\l/(l+N). 

This  means  that,  following  the  omni-convention,  we  omit  the  operator  E  from  the  print 
but  retain  it  in  our  mind’s  eye.  This  convention  adds  to  the  brevity  of  the  notation. 

(It  is  analogous  to  Einstein’s  summation  convention  often  used  in  tensor  and  matrix 
calculus— expectation  is  a  summation.) 

Note:  E\1;(X),  or  \|f(X)  in  the  abbreviated  notation,  can  be  thought  of  as  an  arbitrary 
functional  of  X.  This  simplifies,  e.g.,  congruence  equations  such  as  (1.7) 
below. 
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Omni-Transforms  and  Omni-Equations 


The  expectation  E4f(Z)  of  an  arbitrary  well-behaved  function  of  the  random 
variable  Z  will  be  called  the  omni-transform  of  Z.  (For  the  models  in  this  report, 
functions  with  finite  expectations  are  well-behaved  but  in  other  contexts,  say  in 
queueing  models,  the  existence  of  derivatives  and  their  expectations  is  a  condition  of 
good  behavior.)  It  is  the  essence  of  the  omni-method  that,  along  with  Z,  one 
considers  all  functions  v|r(Z). 

The  simplest  omni-equation  is 

Eil/(A)  =  EiKB)  (a) 

which  says  that  the  random  variables  A  and  B  have  the  same  distribution,  be  they 
dependent  or  independent.  If  (a)  holds,  if  X,  is  independent  of  A  and  of  B,  and  if 
EiKXj)  =  EijrCXj)  then 

E\1/(A+X,)  =  EtKB+Xj)  (b) 

and  this  is  usually  written  as 

Ev|/(A+X)  =  E\|f(B+X)  (c) 

Contrarywise,  if  (c)  holds  then  so  does  (a).  One  has  to  be  careful  in  (b)  that  Xi  and 
X2  are  independent,  respectively,  of  A  and  B.  Thus  one  must  not  take  Xi  =  X2  =  -A 
and  write  (b)  as 

E\l;(A-A)  =  E\l/(B-A)  =  E\|/(0)  (d) 

What  we  said  about  the  simplest  omni-equation  (a)  holds  for  all  omni-equations. 
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Section  1.  The  Project  {h} 

We  start  with  the  simplest  project  in  coin-tossing;  toss  a  (p,q)  coin  till  the  target  h 
(“heads”)  shows  up. 

The  project  {h}  can  be  represented  by  the  infinite  tree  of  Fig.  1: 


Fig.  1.1 


where  p  =  Pr(h)  and  q  =  1-p  =  Pr(t)  in  a  single  toss.  We  assume  that  successive 
tosses  are  stochastically  independent. 

We  now  replace  the  above  infinite  tree  by  the  finite  recursion  tree  in  Fig.  1.2. 
(Since  each  link  pointing  towards  h  has  probability  p  and  each  link  pointing  towards  t 
has  probability  q  we  can  omit  hte  p’s  and  q’s  from  the  diagrams.) 


(hi  stop 


(0  renewal 


[  ]  denotes  stop 


(  )  denotes  renewal 


Fig.  1.2 


If  we  are  interested  in  EN  we  read  from  Fig.  1.2 

EN  =  p.l  +  qE(l+N*)  •  (1.1) 

where  N*  is  the  remaining  numer  of  tosses  following  the  renewal  toss  (t).  Although  N 
and  N*  are  dependenet  (in  fact  N*  is  a  subsequence  of  N)  they  are  statistical  replicas 
of  each  and  EN  =  EN*.  Hence  no  confusion  should  result  if  we  suppress  the  asterisk 
in  (1.1)  and  write  it  as 

EN  *  p.l  +  qE(l+N)  (1.2) 

From  (1.2)  we  get  N  =  1/p. 

2 

Similarly,  if  we  are  interested  in  EN  ,  we  read  from  Fig.  1.2 

EN^  =  p.l^  +  qE(l+N)^  (1.2) 
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Similarly  for  EN^  or  EN^  In  fact  for  any  arbitrary  function  \KN)  of  N  we  read 
from  Fig.  1.2 

E\lfN  =  p\lf(l)  +  qE\lf(l+N)  (1.3) 

an  omni-equation,  so-called,  for  the  model  {N;  h}.  The  omni-convention  improves  the 
esthetics  of  the  typography  by  removing  the  E  but  making  it  understood  so  that  (1.3a) 
says  the  same  thing  as  (1.3). 


\|;(N)  =  p\l/(l)  +  q\lt(l+N) 


(1.3a) 


(Equation  (1.3a))  can  be  also  read  as  saying  “the  r.v.  N  is  linear  combination,  with 
respective  coefficients  p  and  q,  of  the  r.v.  1  and  the  r.v.  1+N*  where  N*  is  a  statistical 
replica  of  N.  Thus  omni-equations  provide  a  notation  for  sums  and  combinations  (in 
particular  convex  mixtures). 

We  will  refer  to  E\lf(N)  as  the  omni-transform  of  N.  This  transform  derives  its 
flexibility  from  the  arbitrariness  of  \lf(  );  and  its  ease  of  operations  from  the  fact  that 
expectations  are  simple  to  apply  to  sums  and  mixtures  of  r.v.’s. 

By  setting  \1/(N)  =  we  get  from  (1.3)  or  (1.3a) 

EZN  =  pZ  +  qzEzN 


EzN= 

l-qz 


the  gf  of  N. 

By  setting  \lf(N)  =  8(N-j),  where  \1/G)  =  1  if  j=0  and  \|;(J)=0  otherwise,  we  obtain 
Pr(N=j)  =  pPr(l=j)  +  qPr(l+N=j)  (l.S; 

since  E8(N-j)  =  Pr(N=j). 

Denoting  =  Pr(N=j)  we  write  (1.6)  as 

N.  =  p8(j-l)  +  qNj_i  (1.5a) 

Of  course  we  can  read  (1.5)  directly  from  Fig.  1.2. 

From  (1.5a)  we  get  successively  Nj  =  p,  Nj  =  qp,  Nj  =  q  p  and  N.  =  q^p. 
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Equation  (1.5a)  is  recognized  as  the  recursion  of  a  geometric  distribution  with 
parameter  p;  we  denote  the  corresponding  r.v.  as  Gp  so  that,  as  in  (1.3a) 

\1/(G_)  =  p\l;(l)  +  q4f(l+G  )  (1. 


We  can  say  that  the  N  in  (1.3)  or  (1.3a)  satisfies 
\KN)  =  vl;(Gp) 

By  setting  \if(N)  =  28(N-j-ik)  we  get  from  (1.5)  the  congruence  equation 
i 

Pr(Nsj,  mod  k)  =  pPr(l=j,  mod  k)  + 

+  qPr(l+N3ij,  mod  k) 

Setting  k=2  and  j  =1  in  (1.7)  we  have,  with 

a  =  Pr(N=l,  mod  2)  =  Pr(N  is  odd) 

a  =  PI  +  q(l-Oii) 


(1.6a) 


Pr(N  is  odd)  = 


(1.7a) 


The  question  suggests  itself.  Are  there  other  r.v.’s  whose  omni-equations  are 
represented  by  the  renewal  tree  of  Fig.  1.2?  Yes,  there  are,  and  this  is  a  major  reason 
why  this  paper  is  written.  We  will  say  that  sucl)  r.v.’s  are  isographic  with  N  with 
respect  to  the  renewal  tree  of  Fig.  1.2. 


The  Models  {H;  h},  {T;  h},  and  {H,T;  h} 

Let  H  =  number  of  heads  among  the  N  tosses  (we  know  that  H=1  but  let  go  on  with  a 
poker  face  as  if  we  did  not  know  it);  and  let  T  =  number  of  tails  among  the  N  tosses. 
Of  course,  H+T  =  N.  The  omni-equations  for  H  and  for  T  can  be  read  off  from  the 
tree  of  Fig.  1.2  which  represents  H  and  T  as  well  as  N.  We  thus  obtain 


vl;(H)  =  p\l/(l)  +  q\j;(H) 
t1;(T)  =  p\l;(0)  +  q\k(l+T) 


(1.8a) 

(1.8b) 


VX' 


In  fact,  the  vector  (H,T)  is  also  isographic  with  N  with  respect  to  the  tree  of  Fig.  2.1 
so  that  we  can  combine  (1.8a)  and  (1.8b)  into  a  single  omni-equation  for  (H,T)  in  the 
model  {H,  T;  h}: 

xl;(H,T)  =  pvl;(l,0)  +  qi^HJ+T)  (1.9) 

We  can  also  read  off  equation  (1.9)  directly  from  Fig.  1.2  rather  than  fuse  (1.8a)  and 
(1.8b).  Both  methods  are  instructive. 

When  considered  separately  equation  (1.8a)  can  be  simplified  to 

(l-q)\|/(H)  =  pvl/(l),  and  since  1-q  =  p,  to  \1/(H)  =  \i/(l); 

and  so  H  =  1 

as  it  ought  to  be.  This  result  is  not  enlightening  but  reassuring  as  to  the  trustiness  of 
the  method. 

From  (1.9)  we  can  derive  the  joint  distribution,  the  mixed  moments  and  the  joint 
gf  for  (H.T).  However,  the  outcome  is  known  since  H  =  1.  Such  exercises  are  of 
interest  for  more  complex  models  where  the  results  may  be  new. 

Duration  of  the  Toss  Project.  Another  omni-equation  which  is  isographic  to  N  in  Fig. 
1.2  arises  when  we  ascribe  random  durations  to  the  tosses.  Let 

x  =  duration  of  an  h-toss 
y  *  duration  of  a  t-toss 
T  =  aggregate  duration  of  all  tosses 
From  Fig.  1.2  we  read 

\|;(t)  =  p\l/(x)  +  q\l/(y+T),  (1.10) 

a  convolution  equation  for  t.  From  (1.10)  we  can  derive  successive  moments  of  t,  the 
Laplace  transform  of  t,  etc.  The  derivation  of  is  painless.  With  i|/(t)  =  t  we  get 

T  =  px  +  ^y  (1.11) 

to  derive  the  Laplace  Transform  for  t  let  \1;(t)  =  From  (1.10)  we  get 


We  now  ascribe  a  discrete  cost  to  the  tossing  process  in  the  following  manner. 

Let  X  and  y  still  denote  the  respective  durations  of  an  h-toss  and  of  a  t-toss.  We 
introduce  a  Poisson  source  of  intensity  X.  which  is  independent  of  the  tossing  process. 
Let  the  discrete  costs  be 

-  number  of  Poisson  arrivals  during  x 
#y  =  number  of  Poisson  arrivals  during  y 
#T  =  number  of  Poisson  arrivals  during  t 

The  Poisson  operator  applied  to  each  summand  in  each  argument  in  (1.10)  results 
in 

=  pil/(^x)  +  q\l/(^y  +  ^t).  (1-13) 

This  is  a  valid  equation  since  the  Poisson  operator  is  additive  over  nonoverlapping 
intervals,  say  A  and  B: 

#(AuB)  =  #A  +  #B  (1.14) 

It  is  known  that  E#x  =  \Ex  and  E#y  =  XEy;  and  that 

Pr(/^x  =  j)  =  E  (1.15) 

(cf.  Gross  and  Harris,  Section  5.1.) 

From  (1.13)  we  can  find  the  distribution  and  moments  of  #t.  The  expectation  of 
can  be  obtained  also  from  (1.11): 

E#t  =  XEt  =  XEx  +  ^  XEy  (116) 

Equations  of  the  type  (1.10)  and  (1.13)  found  an  application  in  the  theory  of 
M/G/1  queues  and  their  variants.  (Cf.  Krakowski  July  1986;  November  1986;  1987.) 

A  costing  Model  Consider  the  following  costing  model.  Toss  a  coin  till  h  results.  The 
cost  of  tossing  an  h  is  .A  and  the  cost  of  tossing  a  t  is  B;  A  and  B  are  random 
variables. 


The  corresponding  omni-equation  is,  denoting  by  K  the  cost  of  tossing  till  h, 
\1;(K)  =  p\l/(A)  +  qilf(B+K) 

Setting  iKK)  =  K  gives  K  =  A  +  ^  B  . 

Since  the  costs  A  and  B  can  be  negative  so  can  K;  hence  the  method  of 
generating  functions  may  not  be  applicable. 
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Section  2  The  Project  {hh} 


We  toss  a  (p,q)  coin  till  we  reach  hh,  i.e.  two  successive  “heads.”  We  start  with 
the  model  {N;hh}  where  we  tally  N,  the  number  of  tosses  in  reaching  the  target 
pattern  hh.  The  r.v.  N  has  been  the  most  commonly  discussed  r.v.  in  models  of 
coin-tossing.  Our  interests,  however,  will  not  be  limited  to  N. 


The  simplest  renewal  tree  for  {;hh}  is 


[  ]  denotes  stopping  toss 
(  )  denotes  renewal  toss 


Fig.  2.1 


and  its  associated  omni-equation  is 


t|/(N)  =  p  \|r(2)  +  pq\if(2+N)  +  q\|/(l+N) 


(The  omni-convention  applies). 


Note  We  referred  to  Fig.  2.1  as  the  simplest  renewal  tree.  A  more  extended  renewal 
tree  for  {N;hh}  is  shown  in  Fig.  2.1a: 


—  (t)  ^  (h) 
(t) 

t<—  (t) 


Fig.  2.1a 


The  omni-equation  for  this  tree  is 


i|f(N)  »  p^\|f(2)  +  pq\l/(2+N)  +  qp^\lt(3)  + 


+  qpqt|;(3+N)  +  q  t|f(2+N) 


(2.1a) 


By  setting  \lf(N)  =  N  we  specialize  (2.1)  to 


N  a  2p^  +  2pq  +  pqN  +  q  +  qN 


m 


from  which  follows 


N  =  -1  +  J. ;  for  ps*q=l/2  N  =  6  (2-2) 

p2  p 

When  \1;(N)  =  (N-N)^  equation  (2.1)  becomes 
E(N-t^^  =  p^2-N)^  +  pqE(2+N-N)^  +  qE(l+N-N)^ 

and,  since  E(N-N)=0  we  get  eventually,  with  N  from  (2.2) 

®N  =  ^  *  P  -  ^2  -  P  *  ’’“''P  °N  "  22  (2.3) 

With  \|f(N)  =  N*^,  k  =  2,  3,  4  etc.  we  can  get  successive  moments  of  N  from  (2.1). 
^tting  i|;(N)  =  we  get  from  (2.1) 

Ez^  =  p^z^  +  pqz^Ez^  +  qzEz^ 
from  which  we  have  the  gf  of  N: 


EzN=  _P^— 
l-qz-pqz2 


(2.4) 


Letting  \1/(N)  =  8(N-j)  we  get,  since  E8(N-j)  =  Pr(N=j), 

Pr(N=j)  =  p^Pr(2=j)  +  pqPr(2+N=j)  +  qPr(l+N=j)  (2.5) 

which,  with 

Nj  ^  Pr  (N=j) 


is 

N.  -  p'8a-2)  +  pqNj.2  +  qNj_i  (2.5a) 

a  recursive  relation  for  Nj;  of  course  N.  =  0  for  j  <  2. 

[We  can  also  read  (2.5)  directly  from  Fig.  2.1.) 

It  is  easy  to  see  directly  from  Fig.  2.1,  or  by  properly  specializing  \j^(N),  that  we  have 
a  relation  analogeous  to  (2.5)  in  which  Pr(N»j,  mod  k)  takes  the  place  of  Pr(N=j). 


In  particular,  when  k=2  we  have  Pr(N»l,  mod  2)  =  Pr(N  is  odd) 


Pr(NaiO  mod  2)  =  Pr(N  is  even) 

Let  for  brevity, 

a  =  Pr(N  is  odd),  3  =  1  -  =  Pr(N  is  even) 

When  \|f(N)  =  a  we  have  \|/(2+N)  =  a  and  \)/(l+N)  =  1-c^;  (2.1)  becomes 
2 

ci{  =  p  -O  +  pq  •  (y  +  q(l-a) 
and  thus 


and  3  =  I-o^  = 


(2.6a) 


Of  course,  we  can  read  (2.6)  directly  from  Fig.  2.1.  When  p=q=l/2  we  have  a  =  2/5 
and'3  =  3/5. 

We  turn  now  to  the  processes 

H  a  number  of  h-tosses  upon  reaching  the  stop  (here  the  second  h  in  hh) 

T  a  number  of  t-tosses  upon  reaching  the  stop 

Each  of  these  processes  is  clearly  isographic  with  N  with  respect  to  the  renewal  tree  in 
Fig.  2.1.  The  omni-equations  for  H  and  for  T  are,  as  read  from  Fig.  2.1, 

\|f(H)  a  q\|;(H)  +  pq\l/(l+H)  +  p^\|;(2)  for  {H;hh}  (2.7) 

UT)  a  q\|;(l+T)  +  pq\lf(l=T)  +  p^4f(0)  for  {T;hh}  (2.8) 

Since  H  and  T  are  each  isographic  with  N  with  respect  to  Fig.  2.1  then  so  is  the  vector 
(H,T).  Hence  we  can  fuse  (2.7)  and  (2.8)  into  a  bivariate  omni-equation  in  H  and  T 
(with  the  omni -convention  in  force) 

v|;(H,T)  a  qv|,(H,  1+T)  +  pqid+H,  1+T)  +  p'v|/(2,0)  (2.9) 

where  \KH,T)  is  an  arbitrary  function  of  H  and  T  in  the  model  {H,T;hh}. 


When  standing  alone  equation  (2.7)  can  be  simplified  to 
\|f(H)  a  p\|;(2)  +  q\|;(l+H) 


(2.7a) 


and  equation  (2.8)  to 

\|r(T)  *  p'\lf(0)  +  q'\|;(l+T)  where  p'  =  p^  and  q'  *  1-p^ 

Equation  (2.7a)  is  equivalent  to 
vl;(H-l)  =  p^l;(l)  +  qTlf(H-l+l) 
which,  upon  comparison  with  (1.6)  implies  that  in  {H;  hh} 
i|f(H-l)  =  >lf(Gp)  and  vl;(H)  =  il/(l+Gp) 

which  says  that  the  r.v.  H-1  is  a  geometric  r.v.  with  parameter  p  for  {H;hh}. 
Similarly  (2.8a)  implies  that  in  {T;  hh} 

\Jr(T+l)  =  t{f(Gp' )  where  p'  =  p^ 

which  says  that  the  r.v.  T+1  is  a  geometric  r.v.  with  parameter  p'  =  1-p^. 
From  (2.7b)  and  (2.8b)  we  get 

H  =  1  +  G  a  1  +  1/p 


T  =  -1  +  Gp'  =  -1  +  l/p'  =  -1  +  1/p' 


(2.8a) 


(2.7b) 


(2.8b) 


(2.10) 


(2.11) 


and  we  verify  that  H  +  T  =  l/p  +  i/p  =  N  as  in  (2.2).  From  (2.7a)  and  (2.8a)  we 
can  find  the  moments  and  distributions  of  H  and  of  T. 

We  turn  now  to  the  omni-equation  (2.9)  in  order  to  explore  the  joint  behavior  of 
H  and  of  T.  Let  iKH.T)  =  HT  and  we  obtain 

m  aq[H  +  Kf]  +pq[l  +  H  +  Kf  ]  +  p^  O 
from  which  we  get,  aided  by  (2.10)  and  2.11), 


HT  =  2/p  -  l/p  -  1 
To  get  the  joint  probabilities 
Ni,  i  Pr(H=i,  T=j) 


(2.12) 


Wi* 


we  set  \1/(H,T)  =  E8(H-i)8(T-j)  =  Pr(H=I,T=j)  and  obtain  from  (2.9) 
Pr(H=i,T=j)  =  qPr(H=i,l+T=j)  +  pqPr(l+H=i,l+T=j)  +  p^Pr(i=2,  j=0) 


(2.13) 


Nij  =  qNi.j-1  pqNi-i,j_i  +  p^8(i-2)80)  (2.13a) 

Of  course,  Njj  =  0  when  i  <  2  or  j  <  0. 

From  (2.13a)  we  have  N2.0  =  p^  ;  N2.1  =  qN2.o  =  P^q  :  N2.j  =  pV:  Ni.o  =  0 
for  i<2  ;  Na.i  =  q-0  +  pqN2,o  =  P^q  :  etc. 

To  get  the  bivariate  gf  for  H  and  T  we  set 
\lt(H,T)  =  z”  ■  zj; 

thus  (2.9)  becomes 

Ez^  •  EzJ  =  Ez^  -E  zj  +  pqzj  •  Zj  *  Ez^  •  EzJ  +  pjZj^ 
and  we  get  the  sought  for  generating  function 

D^Z  ^ 

EzHzT  , 

1  2  l-qZj-pqZjZj  ''  ^ 


(2.14) 


It  is  often  the  case  that  the  simplest  way  to  derive  a  gf  is  to  specialize  an 
omni-equation  as  read  from  a  recursion  tree  (or  a  set  of  such  trees,  as  we  shall  see 
soon)  or  from  some  other  structural  graph.  This  has  certainly  been  the  case  in  our 
problems.  Introduced  as  a  prop  for  the  span  and  the  focus  of  attention,  the  recursion 
trees  turned  out  to  be  of  essence  in  the  structure  and  classification  and  analysis  of  the 
models.  Thus  the  notion  of  isography  is  a  direct  result  of  this  deepened  understanding 
and  so  is  the  ability  to  formulate  multivariate  problems  (e.g.  H  and  T  jointly)  almost 
as  easily  as  univariate  problems.  Problems  which  appear  weighty  exercises  in 
combinatorics  (e.g.,  the  probability  that  H  is  odd  and  T  is  even)  are  routinely  treated 
as  special  cases  of  the  relevant  omni-equation.  But  more  can  be  done  siill.  Consider, 
e.g.,  the  r.v. 

d 

M=  number  of  doublets  ht  obtained  on  the  way  to  hh.  Note  that  the  M-process 
is  isographic  to  N,  and  we  get  from  the  recursion  tree  in  Fig.  3.1: 


I 

J 

i 


I 
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\KM)  =  q\l/(M)  +  pq\l/(l+M)  +  p^\lr(0) 


(2.15) 


Equation  (2.15)  can  be  fused  with  (2.1)  into  a  bivariate  equation 

\1;(N,  M)  =  q\|f(l+N,  M)  +  pq\ir(2+N.H-M)  +  p^\l/(2,0)  (2.16) 

or  it  can  be  fused  with  (2.9)  to  obtain  the  trivariate  equation 

tJ;(H,T,M)  =  qTir(H.l+T,M)  +  pq\|/(l+H.l+T.l+M)  +  p^\|;(2.0,0)  (2.17) 

Equation  (2.15)  when  considered  apart  from  its  relation  to  other  isographic  random 
variables  can  be  simplified  to 

\|;(M)  =  p\K0)  +  q\l;(l+M)  (2.15a) 

Equation  (2.15a)  can  be  written,  by  adding  1  to  each  argument, 

\|/(1+M)  =  p\lf(l)  +  q\lf(l+l+M)  and  we  recognize  that 

\1;(1+M)  =  \|f(Gp)  (2.15b) 


which  implies  M  =  Gp  -  I  =  l/p  -  l  =  q/p;  when  p  =  q  »  1/2  we  have  M  «  1. 

In  order  to  show  that  isography  should  always  be  specified  with  respect  to  a 
definite  renewal  tree  consider  the  model  {L;hh}  where 

L  =  number  of  non-overlapping  triplets  hth  on  way  to  hh.  “Non-overlapping” 
means,  e.g.,  that  the  sequence  hhthth  contains  only  one  tallied  hth  triplet  (but  the  last 
of  the  six  indicated  tosses  is  an  h  capable  of  starting  a  new  tallied  hth).  The  tree  of 
Fig.  1.2  is  clearly  not  a  renewal  tree  for  the  model  {L;  hh).  However,  let  us  extend 
that  tree  as  has  been  done  in  Fig.  1.1: 


[h] 


(t) 


(t) 


Fig.  2.2 


The  tree  of  Fig.  2.2  is  clearly  a  renewal  tree  for  both  N  and  L  which  are  therefore 
isographic  with  respect  to  Fig.  2.2  but  not  with  respect  to  Fig.  1.2.  (Note  that  H,  T,  N 
and  L  are  isographic  with  respect  to  Fig.  2.2.) 


Let  us  write  down  the  bivariate  omni-equation  for  (N,L)  directly  from  Fig.  2.2. 

We  have,  and  the  reader  should  verify  it, 

\1;(N,L)  =  q\|f(l+N,L)  +  pq^  \|f(3+N,L)  +  p^q^  \|/(4+N,l+L)  + 

p'q  t|/(4,l)  +  p'  i(2,0)  (2.18) 

From  (2.18)  we  obtain  an  omni-equation  for  N  alone  by  disregarding  the  second 
argument  in  \1;(N,L):  il/(N,L)  -*  il/(N);  and  we  obtain  an  omni-equation  for  L  by 
disregarding  the  first  argument  in  \|f(N.L):  iKN,L)  ->■  i^L),  The  equation  for  L  is  in  its 
nascent  form  (i.e.  as  read  off  from  its  revewal  tree  in  Fig.  2.2). 

v|;(L)  *  qi^L)  +  pq^Tl/(L)  +  p^q^v^l+L)  +  p^q\lf(l)  +  p^Tif(O)  (2.19) 

which  is  simplified  to 

*(L)  -  WltL)  +  .Wl)  ^tCO)  (2.20) 

From  (2.20)  we  can  derive  the  moments  and  the  distribution  of  L.  Without  deriving 
this  distribution  one  can  tell  from  the  form  of  (2.20)  that  the  sequence  of  the  Lj, 

where  »  Pr(L»j),  is  geometrically  progressing  starting  wivh  Lj  and  that  Lq  »  l/(l+q). 
The  expected  value  L  is 

L  »  q/(l+pq)  ;  for  p  =  q  =  1/2  we  have  L  =  2/5  (2.21) 

As  in  Section  1,  we  can  ascribe  durations  to  the  h-tosses  and  the  t-tosses;  and  we 
can  apply  the  Poisson  operator  to  these  durations  but  we  will  not  pursue  this  matter 
further. 

In  Section  5  we  will  derive,  in  more  than  one  way,  the  omni-equation  for  the 
model  {N;  k*h}  and  the  model  {H,T;  k*h}  where  k*N  stands  for  a  string  of  k  “heads”. 

Note  What  can  we  say  about  {N,L;}  based  on  the  tree  of  Fig.  2.1?  Well,  we  can  still 
write 

\|;(L)  «  p^vj;(0)  +  pq\j/(Lht)  +  qvl/(L),  (2.22) 

where  Lht  is  the  conditional  random  variable  denoting  the  number  of  triplets  hth  on 
way  to  hh  provided  that  the  first  two  tosses  were  h  and  t. 
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But  (2.22)  is  one  equation  in  two  random  variables  and  hence  incomplete.  It  is 
easy  to  fuse  N  and  L  into  vl/(N,L).  When  standing  alone  (2.22)  can  be  simplified  to 

\|/(L)  =  p\l;(0)  +  q\|/(Lht)  (2.22a) 

We  wish  to  point  out,  without  pursuing  the  matter  in  depth,  that  Fig.  2.1  can  be 
generalized  to 


[hi 
(t) 

Fig.  2.3 

whose  omni-equation  for  (H,T)  is 

\if(H,T)  =  q,  \1;(H,1+T)  +  p,q2  il;(l+H,l+T)  +  pjpj  \if(2,0)  (2.23) 

where  Pr(“heads”)  =  pj  for  a  toss  right  after  an  h-toss 

=  p,  otherwise 

and  Pr(“tails”)  =  qj  =  1  -  P2  for  a  toss  right  after  an  h-toss 

qj  =  1  -  Pj  otherwise 

We  are  now  ready  to  deal  with  models  which  cannot  be  graphed  by  a  single  finite 
renewal  tree.  Such  a  model  is  {r;  hh}  where  r  is  the  number  of  reversals  h  -» t  or 
t  -♦  h;  the  best  way  to  see  it  is  to  try  to  draw  a  renewal  tree  for  this  model. 
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Section  3  Furcation  Method 


Consider  now  the  project  {ht}  where  we  toss  a  (p.q)  coin  till  the  couplet  ht,  the  target, 
is  realized.  The  renewal  tree  for  this  project  is  shown  in  Fig.  3.1  below: 


h - 

.h - [tj 

—  (t)  renewal 


h 


-It] 

stop 


stop 


Fig.  3.1 


This  tree  is  infinite  and  cannot  be  condensed  into  a  single  finite  renewal  tree. 
(Though  infinite,  the  above  tree  is  still  easy  to  handle  for  {N;ht}  and  some  other 
models  because  of  its  simple  geometry  but  we  will  not  follow  this  possibility  since  it 
represents  a  special  case  only.) 

Let  us  now  consider  the  first  toss  as  shown  in  Fig.  3.2: 


h  mid-toss 
(t)  renewal 


Fig.  3.2 


The  t-toss  is  a  renewal,  as  indicated.  But  the  h-toss  is  an  intermediate  toss,  that  is  a 
toss  between  a  renewal  (or  beginning)  and  a  stop.  We  will  refer  to  such  a  toss  as  a 
mid-toss. 

As  graphed  in  Fig.  3.2,  the  first  toss  provides  us  with  partial  information  which 
can  be  stated  as  an  omni-equation.  For  the  model  {N;ht}  this  equation  is 

^  h 

^KN)  =  p\|t(l+Nh)  +  q\k(l+N)  (3.1) 

where  is  the  remaining  number  of  tosses  provided  that  we  already  have  “heads”; 
thus,  Nj,  is  a  conditional  r.v.  The  furcula  to  the  left  of  the  equation  is  a  useful 
mnemonic  device  and  we  will  use  it  repeatedly.  In  (3.1)  we  have 'used  the  fact  that 
\l/(Nt)  =  \|/(N)  since  this  toss  is  a  renewal. 

Let  us  continue  with  a  toss  conditioned  upon  our  having  just  tossed  heads.  Fig. 
3.3  represents  this  conditional  toss. 
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Fig.  3.3 


The  omni-equation  corresponding  to  Fig.  3.3  is 


'KNh)  =  pvl;(l+Nh)  +  q\i/(l) 


(3.2) 


Note  that  from  equations  (3.1)  and  3.2)  we  can  reconstruct  Fig.  3.1  and  Fig.  3.2 
(without  using  the  furculas  to  the  left  of  the  equations).  This  goes  to  show  that  the 
fork  equations  for  N  and  retain  the  memory  of  the  underlying  structure.  This 
feature  is  characteristic  of  the  “furcation”  method. 

Equations  (3.1)  and  (3.2)  have  two  unknowns  random  variables,  namely  N  and 
Njj,  and  we  therefore  expect  them  to  be  a  determined  set.  The  r.v.  Nj.,  may  be 
thought  of  as  an  auxiliary  r.v.  but  it  may  be  of  direct  interest  to  some  analysts;  after 
all,  the  problem  of  dividing  the  pool  in  the  midst  of  a  gambling  game  arose  early  in 
the  history  of  probability. 

From  Section  1  we  recognize  (3.2)  as  the  equation  solved  by  =  Gq  or 
\|f(Njj)  *  'k(Gq)  where  Gq  is  a  geometric  r.v.  with  parameter  q.  In  fact,  from  the 
structure  of  the  model  we  can  reason  out  that 

il;(N)  =  i|/(Gp  +  Gq)  and  ilt(Nh)  =  tl/(Gq)  (3.3) 

The  number  of  tosses  leading  to  the  first  h  is  a  Gp,  as  we  well  know;  and  the  number 
of  additional  tosses  leading  to  the  first  t  which  follows  the  first  h  is  a  Gq.  Hence  we 
assert  (3.3).  In  diagram  form,  we  have 


Fig.  3.4 


From  (3.1)  and  3.2)  we  can 

and  generating  functions.  Let  us 
and  3.2)  become 


obtain  for  N  and  for 
start  with  N  and  Nj^ . 


Nf,  their  moments,  distributions 
For  \j/(N)  =  N  equations  (3.1) 
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N  =  p(l+Njj)  +  q(l+N)  and  Nj,  =  p(l+Nj^)  +  q-1 
which  implies 

Nj^  =  1/q  and  N  =  1/p  +  1/q  =  1/pq 
Of  course,  (3.4)  also  follows  from  (3.3). 

Upon  setting  \|/(N)  =  equations  (3.1  and  (3.2)  become 
Ez^  =  pzEz^**  +  qsEzN  and  Ez^**  =  pzEz^**  +  qz 
which  results  in 


Ez^h  =  -BL. 

l-qz 


and  EzN - P3£f _ 

(l-qz)(l-pz) 


where  each  of  the  two  generating  functions  is  explicitly  represented  as  a  rational 
function  of  z  . 

The  probabilities  (Nj)  =  Pr(N=j)  and  (Nh)j  =  Pr(Nj^=j)  are  obtained  by  setting 
t|;(N)  =  Pr(N=j),  etc.  Equations  (3.1)  and  (3.2)  become  then 

Pr(N=j)  =  pPr(l+Nh=j)  +  qPr(l+N=j)  and  Pr(Nh-=j)  =  pPr(l+Nh=j)  +  qPr(l=j) 

from  which  all  (Nj)  and  (Nj,)^  can  be  derived  recursively  starting  with  (Nj,)!- 

Suppose  now  that  we  want  to  solve  (3.1)  and  (3.2)  for  N,  i.e.  to  find  an 
omni-equation  for  N  alone.  From  (3.1)  we  get 

p\|f(l+Nh)  =  i|/(N)  -  q\|/(l+N) 

and,  dividing  throughout  by  p  and  subtracting  1  from  each  argument,  we  have 
♦(N„)  .  yiWN-I)  -  iiMN) 

Substituting  the  last  two  equations  into  (3.2)  we  get  an  uncoupled  equation  for  \|;(N)  in 
{N;ht}: 

\|/(N)  =  \1/(1+N)  -  pq\|/(2+N)  +  pq4f(2)  (3.6) 

Setting  \1/(N)  =  Pr(N=:j)  =  (N)j  in  (3.6)  we  get 


*Ah 

jg* 

1 

(N)j  =  (N)j_i  -pq(N)j-2  +  PqSO-2)  where  (N)j  =  0  if  j  ^  1 


(3.7) 


I 

i 


From  (3.7)  we  have  (N)2  =  pq.  (N)3  =  P^-  (N)4  =  Pq(l-pq).  (N)5  =  pq(l-2pq),  etc. 

Of  course,  in  numerical  work  p  and  q  =  1-p  are  fixed  and  the  recursion  involves  only 
real  numbers,  not  symbols.  Cf.  (3.18b)  for  an  explicit  solution  of  (3.7).  One  can  also 
derive  (3.6)  from  (3.5).  In  fact,  generating  functions  can  be  very  effective  in  such 
uncoupling  of  the  random  variables  entering  omni-equations.  This  is  not  really 
surprising  since  generating  functions  have  been  applied  to  the  solution  of  difference 
equations  of  which  omni-equations  are  a  special  species. 

A  solution  to  a  coin-tossing  model  can  be  given  by  an  explicit  formula  for  the 
probabilities  (or  moments),  by  recursion  formulas  for  the  probabilities,  by  generating 
functions,  or  by  omni-equations.  It  is  often  very  easy  to  switch  from  one 
representation  to  the  other,  especially  among  omni-equations,  generating  function,  and 
recursions  for  probabilities.  The  Table  3.1  below  shows  corresponding  terms  for  the 
three  representations  for  A,  a  r.v. 


Omni-Term 

Generating  Term 

Probability  Term 
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80-1) 

z2 

80-2) 
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80-k) 
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Wa) 
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(G,)j=a(l-a)'-‘ 

v! 

l-(l-a)z 

Table  3.1 

^  k 

r 

Writing  (3.5)  as  a  linear  combination  of  terms 

5 


-  qzEz'^h  =  qz  and  Ez^  -  zEz^  +  pqz^Ez^  =  pqz^ 


(3.5a) 


we  can  transpose  them,  term  by  term,  into  their  omni-form  with  the  aid  of  Table  3.1: 
'KNh)  "  qiKNh+1)  =  q'l^(l)  'l^(N)  -  'KN+l)  +  pq\|;(N+2)  =  pqvl/(2)  (3.8) 


V. 


ym 
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the  second  of  which  is  equivalent  to  (3.6).  Equation  (3./8)  or  (3.5a)  can  likewise  be 
transposed  into  its  equivalent  recursion  in  probabilities; 

(Nh)j  -  q(Nh)j-i  =  qSO-1)  and  (N)j  -  (N)j_i  +  pq(N)j-2  =  Pq80-2)  (3.9) 

the  second  equation  of  (3.9)  being  equivalent  to  (3.7). 

It  is  important  that  we  are  able  to  switch  easily  between  the  omni-form,  the 
generating  function  and  probability  recursions  of  a  model.  We  may  want  to  have  the 
omni-equation  to  calculate  the  moments  or  the  probabilities  modulo  some  integer  but 
we  happen  to  have  the  generating  function  or  a  probability  recursion.  The  literature 
abounds  in  generating  functions  upon  which  we  can  draw.  Moreover,  even  a  skilled 
practitioner  of  omni-equations  may  encounter  a  problem  where  a  probability  recursion 
is  easier  to  derive  than  an  omni-equation.  And  even  in  essentially  omni-work  we  may 
want  to  solve  a  system  of  fork  equations,  often  much  bigger  than  (3.1)  and  (3.2), 
where  generating  functions  may  be  the  method  of  choice. 

We  now  prove  a  theorem  both  by  means  of  omni-equations  and  generating 
functions,  the  latter  aided  by  a  partial  fraction  expansion. 

Theorem  Let  and  be  independent  geometric  random  variables  with 
respective  parameters  a  and  b  where  a  #  b.  Then 


4-(G,  t  q,)  .  .5^  1|.(G,)  +  ^  ) 


a  -  b 


(3.10) 


Equation  (1.6)  states  that 

\l;(Ga)  =  a\lf(l)  +  (l-a)\|/(l+Ga)  (3.11a) 

\lr(Gb)  =  bil;(l)  +  (l-b)tl;(l+Gb)  (3.11b) 

Shift  now  the  first  of  the  above  equations  by  G^  and  a  the  second  by  G^: 

i);(Ga+Gb)  =  aiKUGb)  +  (3.12a) 

tl/(Gb+Gb)  =  biKl+GJ  +  (l-b)^l^(UGb+G,)  (3.12b) 

By  eliminating  the  term  \1;(1+G3+Gb)  from  the  above  two  equations  we  get 

(a-b)tlf(G,+Gb)  =  a(l-b)t(UGb)-b(l-a)tl;(l+G3)  (3.13) 
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From  (3.13)  and  (3.11a)  and  (3.11b)  we  have  (3.10).  Note  that  the  right-hand  side  of 
(3.10)  is  a  linear  but  not  convex  sum  since  either  b-a  or  else  a-b  is  negative.  Thus, 
as  has  been  pointed  out  by  Botta  and  Harris  (1986)  linear  but  nonconvex  sums  of 
random  variables  arise  in  contexts  where  the  original  formulation  of  a  problem  is  in 
the  form  of  sums  and  convex  combinations  of  random  variables. 

It  can  be  shown  by  shifting  (3.10)  by  G,.  that  one  has 


1KG3  +  Gb  +  G,)  = 


J2£. 


(b-a)(c-a) 


4f(GJ  + - sa - ,l,(Gu )  + - ah - ii;(G,) 

(a-b)(c-b)  C'' 


(a-c)(b-c) 


(3.14) 


where  a,  b,  and  c  are  distinct.  (It  should  be  added  that  it  is  possible  to  interpret  G^ 
so  that  a  need  not  be  positive  and  less  than  1  but  we  cannot  delve  here  into  this 
matter.) 

We  derive  now  an  expression  equivalent  to  (3.10),  in  terms  of  generating 
functions.  Since  G^  and  Gj,  are  assumed  independent,  and  since 


Ez^a  = 


JU- 


l-(l-az) 


and 


Ez^b 


J2Z. 


l-(l-bz) 


we  have 


Ez^a+^b  =  Ez^aEz^^b  =  _ az _  bz _ 

l-(l-az)  l-(l-bz) 

With  the  aid  of  the  partial  fraction  expansion  and  its  algebra  we  get 

_ az  •  bz _  _  b  az  a  bz 

[l-(l-az)l[l-(l-bz))  “  b-a  l-(l-a)z  *  a-b  l-(l-bz) 


(3.15) 


(3.16) 


(3.17) 


From  (3.15),  (3.16)  and  (3.17)  we  have 


Ez^a+Ob  ^  _h_  Ez®»  +  -J-  Ez^ 
b-a  a-b 


(3.18) 


which  translates  (3.10)  into  the  language  of  generating  functions. 

Of  course,  (3.10)  yields  (3.16)  upon  setting  \l/(Ga)  =  z  a^  etc.,  but  we  wanted  a 
purely  gf  derivation.  Setting  in  (3.10)  i^Gj)  =  Pr(G3*j)  etc.,  we  get 
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Pr(0,  ♦  Ob  .  j)  .  ^  Pr(G,  »  i)  +  -^  PfCOb  -  i) 
b-a  a-D 

=  _L_  (1-Ji-^a  +  ^(l-b)i-ib 

b-a  **  a-b 


Setting  a  =  p  and  b  =  q  in  (3.18a)  we  have  for  the  {N;ht},  in  view  of  (3.3), 


(3.18a) 


N.  =  -3_  qj-ip  +  p  j-iq 
J  q-p  ^  p-q  ^  ^ 

an  explicit  expression  for  Nj. 


(3.18b) 


We  return  now  to  the  project  {hh}.  Although  the  model  {N;hh}  can  be 
represented  by  a  single  finite  renewal  tree  the  model  {r;hh}  (where  r  «  number  of 
reversals  h  -►  t  and  t  -» h  in  the  project  {hh})  cannot  be  so  represented.  We  will  apply 
the  general  forking  method  to  analyze  it.  The  first  toss  is  described  by  the 
omni^equation 


:r_  t  '!'(>■)  =  P^(rh)  + 

where  rjj  «  number  of  remaining  reversals  if  the  first  toss  is  h 
tj  =  number  of  remaining  reversals  if  the  first  toss  is  t. 


The  fork-equations  conditioned  upon  having  just  tossed  h,  or  else  just  tossed  t,  are 


(3.19) 


'lf(rh)  =  P'KO)  + 


t  ^(ft)  =  Pv^l+fh)  + 


(3.20) 


(3.21) 


The  three  equations  (3.19),  (3.20)  and  (3.21)  allow  us  to  derive  the  distributions  of  r, 
or  rj,  and  of  r^;  their  moments  and  their  generating  functions;  and  other  functionals. 
Note  that 

R  a  r+1,  Rjj  a  r^  and  Rj  = 

where  R  a  number  of  runs  in  any  sequence;  R^  and  are  the  corresponding 
conditional  random  variables. 


SJ*  * 

w 


7tjC''y*y“y’y!  yy^y^y'/yy’  yyyyyjy.' 


y:'y:> 


Si 


! 

i 


6 


I 


r; 


r^ 


,  / 


hi* 

■jf 

a 

r  * 


It  is  easy  to  verify  that 

r  =  (2-p2)/p,  r  =  2q/p  and  r  ^  =  (l+q)/p 
When  p  =  q  =  1/2  we  get  =  2.5,  r  =  2,  and  r  j=  3. 


(3.21a) 


We  derive  the  generating  functions  for  r,  for  r^  and  for  r^  by  setting  \|;(r)  =Ez^, 
etc.  After  a  little  algebra  we  get 


Ez^  =  Ez"h=_P_;  Ez^  =  -H1- 

l-qz®  l-qz2  1-qz 


(3.22) 


Aided  by  Table  3.1  we  recycle  these  gfs  into  uncoupled  omni-equations  for  r,  for  r^ 
and  for  r. : 


\l;(r)  =  p>(0)  +  pq\lf(l)  +  qT|f(r+2) 

'k(rh)  =  P^KO)  +  q^(2+rh) 

Wi)  =  P'lf(l)  +  qiK2+rt) 

From  Table  3.1,  or  by  setting  \l/(r)  =  Pr(r=j)  =  (r).  in  (3.23) 
we  get 

(r)j  =  P^80)  +  pq80-l)  +  q(r)j.2 

and  successively 

(r)o=P^  (r)i=Pq  (r)2=qp^  (r)3=pq^  (r)2j=q^P^  (r)2j+i  =  pq 

We  find  a=Pr(r  is  odd)  by  setting  \|f(r)  =  Pr(r  is  odd)  in  (3.23): 

Of  =  pq  +  qof,  hence  a  =  q 


(3.23) 

(3.24) 

(3.25) 


(3.26) 


2j+l 


(3.27) 


A  major  advantage  of  the  omni-method  is  its  ease  in  formulating  multivariate 
problems.  Suppose  that  we  are  interested  in  jointly  treating  N  and  r  in  the  project 
(hh);  we  designate  the  model  of  interest  as  {N,r;hh}.  Let  us  proceed  in  developing  the 
elementary  forks  (the  furculae)  and  their  associated  omni-equations.  The  first  toss 
results  in 
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\|;(N,r)  =  p\k(l+Nh,rh)+q'Kl+Nt,rt) 


(3.28) 


Unlike  in  the  model  {N;hh}  we  now  distinguish  between  ij/CNj)  and  \lf(N)  because 
of  the  interdependence  between  Nj  and  r^,  The  fork-equations  conditional  upon  the 
prior  toss  having  been  a  non-stopping  h,  or  else  t,  are 

Wh*  '‘h)  =  P'Kl-O)  +  q'Kl+Nt.l+rJ  (3.29) 

iKNt.rt)  =  P'Kl+Nh.l+rh)  +  q\Kl+Nt  ,1+rj)  (3.30) 

From  the  above  equations  we  can  find  the  joint  (and  marginal)  distributions  for  N  and 
r;  for  Njj  and  rjj;  and  for  Nj  and  r^;  we  can  find  recursively  all  the  mixed  moments 
for  the  pairs  just  indicated;  we  can  find  the  bivariate  gf’s  for  these  pairs;  and  we  can 
find  some  other  bivariate  functionals. 

At  this  point  it  is  worth  indicating  how  to  combine  N  and  r  (and  in  other 
problems  any  number  of  random  variables)  into  a  random  vector.  Define 

V  =  (N.r),  Vh  .  (Nh.fh)  and  V,  .  (N.,r,) 
oij  =■  (i.i) 

The  fork-equations  (3.28),  (3.29)  and  (3.30)  can  be  extended,  using  this  vector 
notation,  to 


\|/(V)  »  p\|/(o,o+Vh)  +  q\l/(Oio+Vj) 

(3.28a) 

'KVh  =  P'Koio)  +  qvKai,  +  Vj) 

(3.29a) 

Wt)  =  P^(a,,)  +  qiKoio  +  Vj) 

(3.30a) 

The  above  form  should  be  somewhat  more  comfortable  for  uncoupling  the  set  (3.28), 
(3.29)  and  (3.30)  in  a  way  analogous  to  deriving  (3.8)  from  (3.1)  and  (3.2)  via  (3.5a) 
for  the  model  {N;ht}. 

Many  authors  on  probability  consider  that  their  task  is  completed  upon  deriving  a 
generating  function  or  a  set  of  such  functions  (or  the  kindred  Laplace  transforms). 

We  claim  the  same  privilege  at  this  time. 


The  flexibility  of  the  omni-method  and  of  the  fork-equations  enables  us  to  replace  or 
supplement  the  process  N,  number  of  tosses,  by  the  process  K,  the  cost  of  tossing. 

We  can  think  of  N  as  the  cost  when  each  toss  costs  exactly  1  unit.  Suppose  now  that 
the  cost  of  a  toss  depends  on  whether  it  is  h  -» t  or  t  t,  etc.  E.g.  consider  the 
cost-matrix  of  Table  3.2: 


Table  3.2 


More  complex  cost  matrices  can  take  more  tossing  history  into  account.  (Such  costing 
patterns  were  considered  in  some  of  our  work  on  Markov  chains,  to  be  published  in 
the  future.) 

Although  the  model  {N;hh}  is  described  by  a  single  finite  renewal  tree  the  model 
{K;hh},  with  the  K-matrix  in  Table  3.2  cannot  be  reduced  to  such  a  tree.  We  need 
several  fork  equations.  It  is  not  necessary  to  know  in  advance  the  number  of 
elementary  forks,  these  forks  arise  in  the  process  of  developing  the  equations.  In  our 
case  we  have  for  the  cost  K 

ilt(K)  =  piKKh)  +  qik(K,)  (3.31) 

rl/(Kh)  =  pW)  +  qWt)  (3.32) 

il/(K  J  -  pi(-2+Kh)  +  q^l+Kj)  (3.33) 

Setting  \1;(K)  =  K,  etc.,  we  get 

K  s  pKjj+qK^;  Kj^  =  p+qKp  =  -2p+pK^+q+qKj 


from  which  we  find  when  p  =  q  =  1/2 


Section  4  On  Converting  Omni-Equations  in  N  into  Omni-Equations  in  H  and  T 

It  seems  to  have  escaped  general  notice  that  the  equations  for  N,  number  of 
tosses  in  a  project,  carry  enough  information  to  be  transformed  into  bivariate  equations 
in  H  and  T,  the  number  of  h-tosses  and  the  number  of  t-tosses.  This  is  true  whether 
the  N-equations  are  in  the  omni-form,  g.f.  form  or  in  distribution  form.  However,  it 
is  easier  to  observe  this  convertibility  from  N  to  (H,T)  for  omni-equations. 

For  didactic  purposes  let  us  start  with  (2.9)  alias  (4.1): 

^(H,T)  =  q^(H,l+T)  +  pq4f(UH,l+T)  +  p"tl;(2,0)  (2.9)  =  (4.1) 

Specializing  the  general  function  \1/(H,T)  to  \Jf(H+T)  =  \)/(N)  we  have 

ilt(N)  =  qi|;(l+N)  +  pq\l/(2+N)  +  pV(2)  (2.1)  =  (4.2) 

The  question  suggests  itself  now.  How  do  we  rederive  (4.1)  from  (4.2)  if  this  is  at  all 
possible?  For  generally,  operations  which  mingle  variables  tend  to  obliterate 
information;  and  the  passage  from  (4.1)  to  (4.2)  comingles  H  and  T  by  sununing  the 
two  into  N.  But  a  close  comparison  of  (4.1)  and  (4.2)  reveals  that  in  our  case  there  is 
enough  special  structure  in  (4.2)  to  allow  us  to  derive  from  it  (4.1).  Compare  the  first 
right-hand  term  of  (4.1)  with  its  homologue  in  (4.2);  we  have 

q\|i(H,l+T)  and  q\|f(l+N) 

Each  of  these  terms  has  the  coefficient  q,  a  reminder  that  the  last  toss  was  a  t-toss. 
The  summand  1  in  the  argument  (1+N)  tells  that  one  toss  has  taken  place;  and 
summand  1  in  the  bivariate  argument  (H,l-i-T)  tells  that  a  t-toss  has  taken  place. 

Thus,  the  term  qiJt(H,  1+T)  says  twice  that  the  toss  was  “tails”  whereas  the  term 
q\lt(l+N)  says  it  only  once.  But  one  hint  is  enough  for  the  wise,  as  the  saying  goes, 
and  thus  we  know  that 

qilf(l+N)  ~  q\KH,l+T)  (4.3) 

if  indeed  (4.2)  can  be  transformed  into  (4.1). 

Let  us  now  compare 
pqi|/(l+H,l+T)  with  pq\|/(2+N). 

The  coefficient  pq  in  each  of  the  two  terms  bears  witness  to  the  fact  that  one  h-toss 
and  one  t-toss  took  place.  The  argument  (2+N)  confirms  that  altogether  two  tosses 


took  place;  the  bivariate  argument  (1+H,1+T)  specifies  again  that  an  h-toss  and  a 
t-toss  took  place.  Therefore  the  term  pq\|/(2+N)  carries  enough  information  for  its 
conversion; 

pq\J;(2+N)  ~  q\|/(l+H,l+T)  (4.4) 

The  terms  p^\|;(2,0)  in  (4.1)  and  p^v(f(2)  in  (4.2)  each  says  through  its  coefficient,  that 

the  toss  couple  hh  took  place.  The  argument  in  p  i|;(2)  confirms  the  number  of  tosses, 
while  the  argument  in  \|r(2,0)  specifies  that  hh  occurred.  Thus 

p=l|/(2)  ~  pV(2,0)  (4.5) 

Thus  we  have  transformed  (4.2)  into  (4.1).  A  similar  argument  allows  us  to  transform 
any  linear  omni-equation  in  N  into  an  equation  in  (H,T)  provided  that  in  the 

N-equation  each  term  is  of  the  form  p^q^i|/(a+b+N)  or  p“q*’\|;(a+b);  for  such  terms  the 
correspondence  with  (H,T)  terms  is 

p®q*’)tf(a+b+N)  ~  p*q^  \|f(a+H,b+T)  and  p^q*’ \|/(a+b)  ~  p^q*’\l;(a,b)  (4.6) 

We  will  call  such  terms  in  N  or  in  (H,T)  upright.  A  linear  omni-equation  shall  be 
called  upright  if  each  of  its  terms  is  upright.  Thus,  (4.2)  and  (4.1)  are  upright. 

It  is  easy  to  see  that  each  model  for  N  can  be  formulated  as  an  upright 
omni-equation.  This  is  clearly  seen  when  one  starts  with  a  renewal  tree  (it  is  even 
clear  for  an  infinite  tree  such  as  Fig.  1.1)  and  it  is  clear  that  each  nascent  fork 
equation  is  also  upright.  For  the  general  form  of  such  a  fork-equation  is 


t  t(A)  »  p\lt(l+Ah)  +  q\|f(l+At)  (4.7) 

q 

where  the  r.v.  A  is  N  or  or  Nj  or  or  ^  or  another  conditioned  N.  (If  A  = 
Njjj  then  Aj^  =  etc.)  One  can  also  see  that  when  uncoupling  a  set  of  such 

forks  into  a  set  of  univariate  equations  (one  equation  for  N,  another  for  Nj^,  etc.)  one 
can  preserve  the  uprightness  by  careful  algebra. 

But  what  about  those  omni-equations  for  N  which  contain  non-upright  terms? 
For  example,  the  omni-equation  (3.6)  for  (N;ht} 


\|;(N)  =  \j;(l+N)  -  pq\J;(2+N)  +  pq'4/(2) 


(3.6)=  (4.8) 
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contains  a  term  which  is  not  upright,  namely  \j;(l+N).  Without  disturbing  the  upright 
terms  we  multiply  \|f(l+N)  by  p+q  in  (4.8)  thus  obtaining 

\J;(N)  =  p\l;(l+N)  +  q\|;(l+N)  -pq\lr(2+N)  +  pq\|;(2)  (4.9) 

which  is  an  upright  equation.  Converting  (4.9)  term  by  term  we  apply  the  transitions 
i|/(N)  -  i(H.T) 
pil/(l+N)  pvl/(l+H,T) 
qiJ^(l+N)  -  q4f(H.l+T) 

-pq\jf(2+N)  -pqvl/(l+H,l+T) 
pq  (2)  -  pq\l/(l.l) 

and  obtain  an  upright  omni-equation  in  H,T  for  the  model  {N;ht} 

The  polynomial  p+q  is  the  only  polynomial  in  p  and  q  which  is  of  degree  one  and 
which  equals  one.  Likewise,  to  make  the  term  \J;(3+N)  upright  we  multiply  it  by 

(p+q)  .  (The  omni-equations  in  N  where  each  term  applies  to  a  single  path  are  bom 
upright;  but  where  the  terms  corresponding  to  some  different  paths  but  with  identical 
arguments  are  combined  the  uprightness  may  be  lost.) 

The  notion  of  uprightness  applies  also  to  gf  terms  and  equations.  We  say  that  a 
gf-term  for  N  is  upright  if  it  has  the  form 


(pz)®EzN 

and  that  a  gf-term  for  H,T  is  upright  if  it  has  the  form 
(pz,)*(qzj)*’EzH  •  EzJ. 


(4.10) 


(4.11) 


It  is  easy  to  see  that  upright  omni-terms  and  upright  gf-terms  correspond  to  each 
other;  this  holds  for  terms  in  N  and  for  terms  in  H,T. 

The  following  diagram  illustrates  the  relation  between  upright  omni-terms  and 
gf-terms;  and  between  N-terms  and  H,T-terms: 


p*q*’  \j/(a+b+N) 


a  b  a+fab  N 
p  q  z  ^z 


p  ^  q®  \jr(a+H,b+t) 


(pzj)  Fig.  4.1 
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The  above  diagram  remains  valid  when  we  set  N  =  0;  H  =  0;  and  T  =  0. 

[We  could  have  defined  an  upright  gf-term  as  a  result  of  setting  \1;(N)  =  Ez^  or 
setting  aKH.T)  =  Ez”  •  Ez^  in  upright  omni-terms.] 

We  can  likewise  define  uprightness  for  linear  terms  in  recursive  relations  for 
probability  densities  by  specializing  \j/(N)  =  Pr(N=j)  and  Pr(H=i,T=j)  =  Njj  in 

omni-terms.  Denoting  as  before  (N)j  =  Pr(N=j),  and  (N)ij  =  Pr(H=i,T=j)  we  see  that 
upright  terms  for  probability  densities  are 


and 


and 

p^q^Ni-aj-b  ; 

PN^Sj-a-b 

and 

P^q^Si-a  8i-b  ■ 
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Example  from  Feller  Consider  the  model  {N;k*h}  where  k*h  stands  for  a  string  of  k 
“heads.”  Feller,  p.323,  equation  (7.6),  gives  two  equivalent  expressions  for  the  g.f.  of 
N: 


EzN  * 


pkzk 


and 


EzN  = 


1  -  qz(l+pz  ++  pk-i  zk-i). 


pkzk  (-pz) 

1  -  z  +  qpk  zk+i 


(4.13a) 


(4.13b) 


We  notice  that  (4.13a)  is  upright,  i.e.  each  of  its  terms  is  upright  whereas  (4.13b)  has 
one  term  which  is  not  upright,  namely  -z  in  the  denominator.  Not  surprisingly, 
(4.13b)  is  a  result  of  summing  the  (finite)  geometric  series  in  the  denominator  of 
(4.13a)  and  is  useful  for  large  k. 

The  conversion  of  (4.13a)  into  the  gf  for  the  model  {H,T;k*h}  is  straightforward, 
and  we  show  the  result  only. 


EzH 


1  -  qz2(l+pz^  ++  pk-lZjk-l) 


(4.14a) 


In  order  to  convert  (4.13b)  into  the  gf  in  H,T  we  first  make  it  upright  by  replacing 
the  term  -z  in  the  denominator  by  -(p+q)z,  a  very  gentle  intervention: 
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I  f  gJL.  iJ^  WC 


pk  z  k  _  pk+1  zk+1 

Ez^  =  - 

1  -  pz-qz  +  qp*^  zk+i 


(4.13c) 


It  is  important  to  note  that  in  order  to  transpose  (4.13a)  and  (4.13b)  which  are 
N-equations,  into  (4.14a)  and  (4.14b)  below  which  are  (H,T)-equations  we  need  not 
know  the  target  of  the  project,  here  k*h;  we  do  have  to  know  that  we  start  with 
N-equations. 


From  (4.13c)  we  derive  the  corresponding  gf  in  H,T  (Fig.  4.1  may  be  used  for 
term  correspondences)  and  obtain 


Ez^  •  z 


T 

2 


pk  2  k  _  pk+I^k+l 

1  -  pZi  -  qz2  +  (pz^)kqz2 


(4.14b) 


From  (4.14b)  we  now  derive  the  omni-equation  in  H,T.  We  obtain,  by  first  clearing 
the  fraction,  converting  each  term  as  indicated  in  Fig.  4.1 

4/(H,T)  =  pilf(l+H,T)  +  qvlr(H,l+T)  -  p‘'qilr(k+H,l+T) 

+  p'^\lf(k.O)  -  p‘‘'^^\l/(k+l,0)  (4.15) 


Equation  (4.15)  can  be  converted  into  a  recursion  equation  for  Nj.  =  Pr(H=i,T=j)  by 
setting  \|f(H,T)=Pr(H=i,T=j);  then 

Nij  =  pNi-i.j  +  qNi,j-i  -  p'^qNi-k.j-i 

+  p‘^8(i-k)8a)  -  p*‘^^80-k-l)80)  .  (4.16) 


We  will  not  delve  into  the  intricacies  of  how  to  evaluate  numerically  the  Njj  or  the 

mixed  moments  of  H  and  T.  Ou  '  aim  was  to  show  how  to  transform  an  equation  in  N 
into  an  equation  in  T,H.  One  sir.  pie  result  deserves  being  mentioned.  From  (4.15) 
we  extract  the  omni-equation  for  T  alone  by  letting  t1;(H,T)  -►  \1;(T);  after  some 
simplification  we  have 

r|r(T)  =  (l-p‘‘)rlr(l+T)  +  pS(0)  (4.17) 

and  we  recognize  that  \jr(T)  =  t^Gp'  -  1)  where  p'  =  p*^;  cf.  (1.6). 

From  (4.15)  we  get  the  separate  omni-equations  for  \|;(H)  and  \);(T)  by  specializing 
rlr(H,T)  ^  \lr(H)  and  v|/(H,T)  ^  vl;(T): 


^  .  •.  V*  •  K~  ». 


<|;(H)  =  pi|i(l+H)  +  q\li(H)  -p'‘q\KktH)  t  p^iKk)  -  p''*'iMktl) 

IMT)  =  piltOO  t  qiMl+T)  -p'‘q4r(ltT)  t  p^^iCO)  -  p^^'+CO) 
which  simplify  to 

\1;(H)  =  \1;(1+H)  -  p'^'^qvlrCk+H)  +  p'^'VCk)  -  p'‘\l/(k+l) 

v(;(T)  =  (l-p‘‘)ilf(l+T)  +  p^m 
Now,  from  either  (4.13a)  or  (4.15)  we  have,  since  H+T=N, 

\|;(N)  =  \|/(1+N)  -  qp^\|f(l+k+N)  +  p^\if(k)  -  p*^'^^\if(k-»  ) 

From  (4.17a)  we  recognize  that,  with  p'  =  p'^, 
tl/(T)  =  ;lr(Gp' -1);  cf.  (1.6) 

Suppose  now  that  we  are  interested  in  the  r.v.  H  -  T.  By  specializing 
\1/(H,T)  -»  \|/(H-T)  and  denoting  H  -  T  =  L  we  get  from  (4.15) 

iJ/(L)  =  p\|/(l+L)  +  q\lt(-l+L)  -  p‘^q\|/(k-l+L)  +  p'^rl/(k)  -  p'^'^^\l;(k+l) 


(4.15a) 

(4.17a) 


(4.18) 


(4.19) 


(4.20) 


We  obtain  the  moments  of  L  from  (4.20)  like  from  any  other  linear  omni-equation. 
Thus  we  get  by  setting  \|f(L)  =  L. 


L=  p^rj.  .,1 

Q  pk 


(4.21) 


There  is  no  gf  of  L  corresponding  to  (4.20)  since  L  is  not  non-negative.  A  result  like 
(4.21)  would  have  to  be  derived  from  the  bivariate  gf  for  H  and  T,  i.e.  (4.14b),  or 
from  the  joint  distribution  of  H  and  T.  The  recursion  equation  for  the  distribution  of  L 
is  obtained  from  (4.20)  by  setting  \1/(L)  =  Pr(L=j)  =  (L)j: 


Lj  =  P(L)j_i  +  q(L)j+i  -  p‘^q(L)j+i_k  +  p^^SO-k)  -  p*^'^^8(j-l-k) 


(4.22) 


It  can  be  shown  that  for  a  given  k  one  can,  starting  with  k-1  consecutive  values  of 
known  (L)-  (e.g.  (L)q,  (L),  ...(L)j^_^),  derive  the  other  values.  And  it  is  easy  to  see 

that  for  k=l  we  have  L=l;  and  that  for  k=2  we  have  L=2;  and  that  for  k=3  there  is 
neither  a  lower  nor  an  upper  bound  on  (L)j. 


In  order  to  complete  honorably  the  Feller  example  we  derive  for  N  its 
omni-equation  and  its  distribution.  From  Fig.  4.2  below  we  read  (4.23) 


toss#k 

h 


h - 1 


y 

y 
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Fig.  4.2 


\lf(N)  =  q\i;(l+N)  +  pq\lr(2+N)  +  p2qvlf(3-N)  ++  p*‘"^q\j/(k+N)  +  p‘‘\l;(k)  (4.23) 

Setting  \|f(N)  =  results  in  (4.13a). 

Shifting  each  term  of  (4.23)  by  1  and  multiplying  it  by  p  results  in 

pijr(l+N)  =  pq\l/(2+N)  +  p^q\l;(3+N)  +  p^q\|;(4+N)  ++  p‘^q\j;(l+k+N)  +  p*^'^^\j;(l+k)  (4.24) 


A  comparison  of  (4.23)  and  (4.24)  results  in 

4f(N)  «  pij/(l+N)  +  q\|f(l+N)  +  p*'q\lr(l+k+N)  +  p^\l/(k)  -  p*‘'^V(l+k) 


Setting  \1;(N)  =  z^  produces  (4.13b). 


(4.25) 
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Section  5  Olio 

In  this  section  we  analyze  several  examples  which  are  of  interest  because  they  show 
the  scope  of  the  omni-method  or  because  of  some  methodological  aspects  not 
previously  elaborated. 

Example  1  Toss  a  coin  till  at  least  2  out  of  3  consecutive  tosses  are  h;  find  N. 

(M  [h] 


Fig.  5.1 


Note  that  in  the  path  h-t-t  in  Fig.  5.1  the  h  is  not  “operative”  because  it  gets  lost 
upon  the  very  nearest  toss.  From  the  tree  we  read  directly 

'KN)  =  p^ilr(2)  +  p^q4f(3)  +  q\l/(l+N)  +  pq^\J;(3+N)  (5 

and  from  (5.1)  we  have 

Nj  .  p'8(j-2)  *  p*q8a-3)  qN,.j  +  pq^Nj-Sj.,  (5 

Nj  =  p  ;  Nj  =  p  q,  N4  =  p  q  ;  Nj  =p  q  ;  Nj  =  p  q  +  p  q  etc., 


ill(H,T)  =  p'>|t(2,0)  t  pV(2.1)  +  q'WH.ltT)  t  pqV(ltH,2tT)  (5.3) 

Nij  -  p'8(i-2)8(j)  +  p’qN,,  t  pq^Ny-i  ,  pq'Ni-j.j-2 

When  we  toss  till  at  least  2  our  of  k  consecutive  tosses  show  h  we  have  Fig.  5.2. 


[h]  [hi 


[h]  k-1  heads 


.t - 1- - (t)  1  heads  and  k-1  tails 


Fig.  5.2 


We  read  off  the  diagram  that 

\|;(N)  =  p^\|;(2)  +  p^q\lf(3)  ++  p^q'^'^ilrCk)  +  pq‘^~V(k+N)  +  qv^l+N)  (5.4) 

The  reader  should  be  alerted  to  the  fact  that  the  problem  “toss  till  you  get  j  heads 
among  k  successive  tosses  and  count  N”  is  generally  more  complex  combinatorially 
and  cannot  be  represented  by  a  single  finite  renewal  tree;  even  if  j  =  3  and  k  =  5  the 
combinatorics  call  for  the  general  furkation  method  or  a  different  approach  altogether. 

The  reader  should  also  verify  that  the  renewal  tosses  in  Fig.  5.1  and  in  Fig.  5.2 
cease  to  be  renewal  tosses  for  the  r.v  r  (number  of  changes  from  h  to  t  and  from  t  to 
h).  The  general  furkation  method  is  needed  to  analyze  the  project  mentioned. 

Example  2  Model  {N;ht}  Revisited 

We  have  dealt  with  this  model  in  Section  3  already.  Now  we  derive  the  recursion  for 
the  probabilities  (from  which  we  can  get  the  onmi-equation)  by  first  finding  a  few 
initial  probabilities.  Suppose  that  we  can  justify  the  following  form  for  that  recursion 

(N)j  =  pq80-2)  +  c,(N)j_i  +  (5-5) 

We  can  evaluate  Cj  and  Cj  if  we  figure  out  directly  (N)};  we  know  that  (N)2  *  pq.  the 
siseediest  realization  of  ht.  It  is  easy  to  see  that  (Njj  =  (N)2  since  the  triplets  which 
lead  to  N  =  3  are  hht  and  tht;  these  two  jointly  have  the  same  probability  as  ht.  From 
(5.5)  we  now  have 

(N)3  =  Ci(N)2  +  C2(N)i  «  c,(N)2  since  (N)j  =  0 

But  we  know  that  (N)j  =  (N)2  and  therefore  Cj  =  1.  And  since  in  (5.5)  pq  +  Cj  +  C2 
=  1  (this  we  know  to  be  true  since  (5.5)  is  transformable  into  an  omniequation  with 
identical  coefficients)  we  have  C2  =  -pq.  This  agrees  with  (3.7).  If  we  do  not  know 
the  number  of  coefficients  in  (5.5)  (these  are  cq  +  ci  and  C2)  we  can  keep 
computing  them  until  Cg  +  Cj  ++  Cj^  =1.  But  even  with  this  procedure  it  is  thinkable 
that,  e.g.  =  1.  Thus  the  method  is  incomplete  so  far,  it  is  a  plausibility. 
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Let  US  consider  now  the  model  {N;hth}.  We  assume  that  the  form  of  the 
probability  recursion  is 

(N)j  =  P^q80-3)  +  c,(N)j_i  +  CjChOj-a  +  CjChOj.j  (5.6) 

a  plausible  assumption  in  view  of  our  experience  with  omni-equations. 

We  calculate  successively  (directly  from  the  combinatorics  of  the  model,  not  from 
(5.6)) 

(N)3  -  p"q  ;  (N)4  =  (N)3  :  (N)5  =  (l-Pq)(N)3  and  (N)6  *  (1-pq  -  p'q)(N)3  (5.7) 

From  (5.6)  and  (5.7)  we  have 
(N)4  =  Cj(N)3  ;  hence  c,  =  1 
From  (5.6)  and  (5.7)  we  further  have 


I 

s 


'y'' 


(N)5  =  c,(N)4  +  C2(N)3  =  C2(N)3  =  (l-pq)(N)3  ;  therefore  C2  =  -pq 
Equation  (5.6)  can  now  be  written 

(N)j  »  P^q80-3)  +  (N)j_i  -  pq(N>j.2  +  C3(N)j.3 

Since  the  coefficients  on  the  right-hand  side  add  up  to  one  we  have  C3  =  pq  ;  (5.6)  is 
now 

(N)j  =  pq  0-3)  +  (N)j_i  -  pq(N)j-2  +  pq^(N)j-3  (5-8) 

We  can  compute  (hOg  from  (5.8)  and  verify  that  it  agrees  with  the  value  given  in 
(5.7). 

The  omni-equation  corresponding  to  (5.8)  is  (cf.  Table  3.1) 

\1;(N)  =  pq\|/(3)  +  iKl-N)  -  pq\l/(2+N)  +  pq^\]/(3+N)  (5.9) 

Equation  (5.9)  can  be  transposed  into  an  equation  in  H,T. 

Example  3  Consider  the  model  {N;  h.odd  #t,h}.  The  fork  equations  for  this  model  are 


h 

t 


il/(N)  *  p\|;(l+Nh)  +  q\{i(l+N).  since  \l/(Nj)  =  v|/(N) 


(5.10) 
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t  tl;(Nh)  =  p^|/(l+Nh)  +  qx|r(l+Nh,).  since  T|;(Nhh)  = 


(5.11) 


\|r(Nht)  =  P^kCl)  +  q'Kl+Nh).  since  v|/(Nht  t)  =  'K^h)  (512) 


Only  three  fork  equations  suffice  for  a  problem  which  at  first  appears  combinatorially 
complex.  From  the  above  equations  we  easily  get 


+ 1 ,  N.  =  JL  +J. .  R.  =a 


pq  p  ’ 


For  p  a  q  =  1/2  (5.13)  becomes  N  =  8;  Nj^  =  6  ;  =4. 


From  (5.10),  (5.11)  and  (5.12)  we  can  derive  all  higher  moments  and  the  probabilities 
for  N,  for  Njj  for  and  for  Nj  jj.  We  can  also  derive  the  gf s  for  N,  for  Nj^,  for  N 

and  for  by  setting  in  those  equations  \{f(N)  =  z^,  etc.  We  thus  get 


Ez^  a  pzEz^b  +  qzEz^ 


Ez^h  =  pzEz^h  +  qzEz'^t 


Ez^t  *  pz  +  qzEz^h 


from  which  we  get 


EzN _ P^qz3 

1  -  z+q(p-q)z2  +  q3z3 


(5.14) 


Ez^h 


pqz2 

1  -  pz  -  q2z2 


(5.15) 


g^Nht ,  _P£(1iE!L_ 
1  -  pz  -  q2z2 


(5.16) 


The  last  three  equations  can  be  easily  transposed  into  omni-equations  (c.f.  Table  3.1) 


\1;(N)  -  \1/(1+N)  -  q(p-q)4f(2+N)  -  q^\l;(3+N)  +  p^q\l/(3) 


(5.14J 


(5.15a) 


^KNh)  =  ^IfCl+Nh)  +  pvk(l+Nh)  +  q  ^(2+Nh)  +  pqvl/(2) 
Wht)  =  P'Kl+Nht)  +  q'^(2+Nht)  +  P^(l)  -  p'iK2) 


(5.16a) 


The  three  above  equations,  or  the  equations  (5.14),  (5.15)  and  (5.16)  can  be 
transposed  into  the  corresponding  equations  for  the  probabilities.  The  reader 
interested  in  converting  (5.14)  or  (5.14a)  into  an  equation  for  (H,T)  should  make  it 
upright.  A  neat  exercise  is  also  to  derive  (5.15)  by  assuming  (and  plausibly  justifying) 
that 

\|/(N)  =  0£o\1/(3)  +  aj\lf(l+N)  +  a2\|t(2+N)  +  cii3\if(3+N)  (5.17) 

and  deriving  the  aj  from  several  initial  values  (reasoned  out  independently)  (N)j. 
(Transposing  (5.17)  into  a  probability  equation  may  ease  the  exercise.) 

Example  4  Model  {N;h,k*t,h} 

In  this  model  k  is  a  fixed  positive  integer.  It  is  not  difficult  to  reason  out  that 

(^k+2  *  P  q*^  =  (^k+3  “  ~  (^2k+2  (5.18) 

We  assume  that 

(N)j  =  p^q‘^80-k-2)  +  ai(N)j-i  +  a2(N)j-2  ++  ak+2(N)j-k-2  (5.19) 

Thus,  for  k  =  1  we  have  (hOs  =  P^q  =  (N)4  and  (N)j  =  p^q8(j-3)  +  CK,(N)j_i  + 
a2(N)j_2  +  «3  (N)j-3-  This  equation  is  plausible  since  there  are,  as  is  easy  to  check, 
three  fork  equations  when  k=l. 

It  is  further  easy  to  see  that  ttj  =  1  and  (Xj  =  =  °^k  =  ®  when  k  ^  2  by 
comparing  the  (N)j  from  (5.18)  with  the  (N)j  computed  from  (5.19)  for 
j  =  k+2,  k+3 . 2k+2.  We  further  find  that 


(N)2k+3  =  (i-pq‘‘)(N)k+2  and  (N)2k+4  =  (i-pq‘'-p^qk)(N)k;2 
We  get  eventually 

(N).  =  p'q>‘8a-k-2)  +  (N)j_i  -  pq*^  (N)j-k-i +pq‘^*nN)j-k-2 


(5.20) 


(5.21) 
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which  specifies  the  ot^  in  (5.19).  We  note  that  the  coefficients  on  the  righthand  side 
sum  to  1,  as  they  should.  The  requirement  that  the  coefficients  on  the  left-hand  side 
and  right-hand  side  sum  to  the  same  value  follows  from  the  term-by-term 
correspondence  with  the  omni-equation 

\lf(N)=p^q*‘^(k+2)  +\1;(1+N))  -  pq‘‘\l;(k+l+N)  +  pq*‘'^^\l;(k+2+N) 


(5.22) 


by  setting  i|;(*)  =  1. 


Example  5  Model  {N;  k*h,  l*t}  We  state  the  following  equation  for  the  model 
{N:k*h,l*t}  without  derivation: 


(N)j  =  p'^q^  80-k-/)  +  (N)j_i  -  p'"q‘  (N)j-k-l 
from  which  follow  (cf  Table  3.1) 

iJ/(N)  =  p^^q^  'Kk+O  +  \lr(l+N)  -  p'^q^  \l;(k+/+N) 
and 


(5.23) 


(5.24) 


EzN 


pkq/ 


(5.25) 


1  -  2  +  pk  q/  z  k+/ 

When  k=/=l  then  (5.23)  is  the  same  as  (3.7)  and  our  model  is  the  familiar  {N;ht}. 
Note:  Consider  a  string  o  of  h’s  and  t’s  in  some  order.  Assume  that  in  this  string  no 


toss,  except  for  the  last  one,  can  be  a  stopping  toss  whatever  sequence  of  h’s  and  t’s 
would  precede  o.  Then  (5.23),  (5.24)  and  (5.2-5)  are  valid  for  {N;  o}  with  k=#  heads 
in  the  string  o  and  1=#  tails  in  o. 


Example  6  Target  {hh  or  tt) 

Feller,  p.  327,  (8.2)  derives  a  gf  for  the  number  of  tosses,  N,  to  reach  a  string  of 
r  heads  (event  6j)  or  a  string  of  p  tails  (event  Cj)  whichever  comes  first.  He  also, 
(8.6),  derives  gf’s  for  the  probabilities  of  “reaching  €5  in  n  tosses  without  hitting  Cj 
first”  and  of  “reaching  Cj  in  n  tosses  without  hitting  €j  first.” 

We  develop  a  set  of  fork-equations  for  the  random  vector  (N,a)  where  a  =  1,  or 
else  2,  if  the  tossing  results  in  event  Cj,  or  else  in,  event  Cj;  the  random  variables  a^j 
and  aj  are  conditioned  upon  having  just  tossed  h  or  else  t.  Our  example  is  modest  in 


42 


1 


that  we  select  tiny  target  strings  but  the  method  is  indicated  clearly.  We  add  that  the 
forking  method  is  applicable  to  more  than  two  alternative  events  which,  moreover,  may 
be  strings  omer  than  heads  only  or  tails  only.  We  tan  choose,  e.g. 

€j  =  hhtth,  €2  »  tthtt  and  €3  =  hhhttt;  and  instead,  or  in  addition  to,  N  we  can  count 
the  number  of  doublets  ht. 

For  the  model  {N;hh  or  tt}  the  fork-equations  are 

\|;(N,a)  =  pilr(l+Nh,ah)  +  q\Kl+Nt  ,at)  (5.26) 

'KNh.Sh)  =  piKl.l)  +  qv|;(l+Nj,at)  (5.27) 


•  h 
[t] 


\lf(Nt,at)  =  p\if(l+Nj,,ah)  +  q\lf(l,2) 


(5.28) 


Thus,  along  with  (N,a)  we  analyze  (Nh,ah)  and  (Nj,aj);  this  is  in  the  nature  of  the 
furcation  method.  From  (5.26),  (5.27)  and  (5.28)  we  can  derive  the  distribution  and 
the  moments  of  interest.  We  can  with  ease  transpose  these  equations  into  equations  in 
(H,T,a),  (Hjj.Th.a^)  and  (Hj,Tj,at).  We  can  also  derive,  with  some  fair  amount  of 
algebra,  the  uncoupled  equations  (5.29),  (5.30)  and  (5.31): 

\|/(N,a)  =  pq\if(2+N,a)  +  p^vlf(2,l)  +  pq^\j;(3,2) 

+  qp^i|f(3,l)  +  q^\K2,2)  (5.29) 

^(Nhah)  =  Pq'K2+Nh,ah)  +  pv|/(l.l)  +  q^^(2,2)  (5.30) 

\l/(Nj,at)  =  pq\|;(2+Nt,at)  +  p^\K2,l)  +  q\l/(l,2)  (5.31) 

From  (5.29),  which  is  upright  with  respect  to  the  N-argument,  we  get  easily 
(5.32): 

\l/(H,T,a)  =  pq\|;(l+H,l+T,a)  +  p^\K2,0,l) 

+  pq"\k(l,2,2)  +  qp^v|/(2.1,l)  +  q^^(0,2,2)  (5.32) 

We  can  do  the  same  for  (5.30)  and  (5.31),  and  we  can  likewise  transpose  (5.22), 
(5.23)  and  (5.24)  each  into  a  joint  trivariate  probability  distribution.  The  interested 
reader  should  have  no  difficulties  doing  it. 
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